BRANCH 



CRF Problem Report 



The Scientific and Technical Information Center (STIC) experienced a problem 
when processing the following computer readable form (CRF): 




Application Serial Number: 
Filing Date: 

Date Processed by STIC : 



STIC Contact: Mark Spencer, 703-308-4212 
Nature of Problem: 



RECEIVED 

JAN 0 % 2003 

TECH CENTER 1600/2900 



The CRF (was): 



(circle one) Damaged or Unreadable (for Unreadable, see attached) 
Blank (no files on CRF) (see attached) 

Empty file (filename present, but no bytes in file) (see attached) 

Virus-infected. Virus name: The STIC will not process the CRF. 

Not saved in ASCII text 

Sequence Listing was embedded in the file. According to Sequence Rules, 
^ submitted file should only be the Sequence Listing. 
Did not contain a Sequence Listing, (see attached sample) 
Other: 



PLEASE USE THE CHECKER VERSION 3.1 PROGRAM TO REDUCE ERRORS. 
SEE BELOW FOR ADDRESS: 

http:/www.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for. the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service , or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 



Revised 01/29/2002 



w— > 
w— > 
w — > 
w — > 

w~> 
w — > 
c — > 
w--> 
w — > 
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- c - ■-'33 Needed 



IbOO 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/787 , 559B 



DATE: 
TIME 



12/19/2002 
08:59:57 



Input Set : 
Output Set: 



A: \400.001-6seq.WorkFile.txt 
N:\CRF4\12192002\l787559B.raw 



09/787, 559B 



1 Application Project 
2 

3 
0 
4 
5 
6 
8 

9 

10 <150> PRIOR APPLICATION NUMBER: PriorAppNumber : DE 198 42 863 
W— > 11 <151> PRIOR FILING DATE: PriorFilingDate : 1998-09-19 
W — > 13 Sequence 
W — > 14 



<120> TITLE OF INVENTION: Title 
<110> APPLICANT: 

<130> FILE REFERENCE: AppFileRef erence : KM1/PCT 
<140> CURRENT APPLICATION NUMBER: CurrentAppNumber 
<141> CURRENT FILING DATE: 2001-06-18 
Earlier Applications 



5^ sompfe ^ 

Regulatory protein from human keratinocytes ^^f^g^y^ 

Und, ple^R G>*kfr 

of 1*>3-30b- 



ERRORED SEQUENCES 

15 <213> ORGANISM: OrganismName : Homo sapiens 
E— > 0 <160> NUMBER OF SEQ ID NOS : 
E--> 16 <210> SEQ ID NO: 
E— > 16 <211> LENGTH: 
E~> 16 <212> TYPE: 

W~> 16 <400> SEQUENCE: PreSequenceString : 



17 ggcacccagg tgcgcgcgga gccatggtta tcatgtcgga gttcagcgcg gaccccgcgg 60 

18 gccagagtca gggccagcag aagcccctcc gggtgggttt ttacgacatc gagcggaccc 120 

19 tgggcaaagg caacttcgcg gtggtgaagc tggcgcggca tcgagtcacc aaaacgcagg 180 

20 ttgcaataaa aataattgat aaaacacgat tagattcaag caatttggag aaaatctatc 240 

21 gtgaggttca gctgatgaag cttctgaacc atccacacat cataaagctt taccaggtta 300 

22 tggaaacaaa ggacatgctt tacatcgtca ctgaatttgc taaaaatgga gaaatgtttg 360 

23 attatttgac ttccaacggg cacctgagtg agaacgaggc gcggaagaag ttctggcaaa 420 

24 tcctgtcggc cgtggagtac tgtcacgacc atcacatcgt ccaccgggac ctcaagaccg 480 

25 agaacctcct gctggatggc aacatggaca tcaagctggc agattttgga tttgggaatt 540 

26 tctacaagtc aggagagcct ctgtccacgt ggtgtgggag ccccccgtat gccgccccgg 600 

27 aagtctttga ggggaaggag tatgaaggcc cccagctgga catctggagc ctgggcgtgg 660 

28 tgctgtacgt cctggtctgc ggttctctcc ccttcgatgg gcctaacctg ccgacgctga 720 

29 gacagcgggt gctggagggc cgcttccgca tccccttctt catgtctcaa gactgtgaga 780 

30 gcctgatccg ccgcatgctg gtggtggacc ccgccaggcg catcaccatc gcccagatcc 840 

31 ggcagcaccg gtggatgcgg gctgagccct gcttgccggg acccgcctgc cccgccttct 900 

32 ccgcacacag ctacacctcc aacctgggcg actacgatga gcaggcgctg ggtatcatgc 960 

33 agaccctggg cgtggaccgg cagaggacgg tggagtcact gcaaaacagc agctataacc 1020 

34 actttgctgc catttattac ctcctccttg agcggctcaa ggagtatcgg aatgcccagt 1080 

35 gcgcccgccc cgggcctgcc aggcagccgc ggcctcggag ctcggacctc agtggtttgg 114 0 

36 aggtgcctca ggaaggtctt tccaccgacc ctttccgacc tgccttgctg tgcccgcagc 1200 



file://C:\CRF4\Outhold\VsrI787559B.htm 



12/19/02 
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RAW SEQUENCE LISTING DATE: 12/19/2002 

PATENT APPLICATION: US/09/787 , 559B TIME : 08:59:57 

Input Set : A:\400.001-6seq.WorkFile.txt 
Output Set: N:\CRF4\12192002\l787559B.raw 



E — > 
E — > 
W — > 
W — > 
w — > 
w— > 

E— > 
W — > 
E--> 
E — > 
E — > 
E — > 

E — > 



37 cgcagacctt ggtgcagtcc gtcctccagg ccgagatgga 

38 tgcagtggcc cttgttcttc ccggtggatg ccagctgcag 

39 ccgtgtcccc aagcagcctg ctggacacag ccatcagtga 

40 gcctagagga ggagcaggac acgcaggagt ccctgcccag 

41 ccctggccga ggtctccacc cgcctctccc cactcaccgc 

42 cctccaccac ggcaagtcct gcagagggaa ccagctctga 

43 cgagcaaaag ccccgcgggg ctcagtggca ccccggccac 

44 gctccccggt caggctggcc tcgcccttcc tggggtcgca 

45 aggctcaggg gggcttggga ggagctgttc tgctccctgt 
4 6 gggcgtcgga cacctcactg actcaagggc tgaaggcctt 
47 ccacgcggac caaagggttt ctgggactga acaaaatcaa 
4 8 gccaggtccc tgccagccgg gccagcaggg gcggcctgag 

49 agagcccagg cctgcacggc ggcgcagccg gcagccggga 

50 aggtgctaga gcagcagagg ctgctccagt tacagcacca 

51 gctcccaggc cccccagccg gcccctgccc cgtttgtgat 

52 gggctgcccc gctccccagc accctcctca cgtcggggct 

53 tcctgcagac cggcgcgtcc cccgtggcct cagcggcgca 

54 acattggcac cggccccacc gccctccccg ctgtgccccc 

55 ccccaggttg tgagcccctg gggctgctgc agggggactg 

56 cctgctccct aggcacgttt gtcctggtgc agtgagggca 

57 cactgactct tacagcaata acttcagagg aggtgaagac 

58 actttctaga agcgaaataa gcaatacgtt aggtgttttg 

59 aaaaaaaaaa aaa 

60 <212> TYPE: Type : DNA 

61 <211> LENGTH: Length : 2533 

62 SequenceName : SEQ ID NO: 1 

63 SequenceDe script! on : 
65 Sequence 

66 



E — > 



E — > 
E— > 
W — > 
W — > 



ctgtgagctc 
cggagtgttc 
ggaggccagg 
cagcacgggc 
gccatgtata 
cagttgtctg 
tcaggggctg 
gtccgccacc 
cagcttccag 
tcggcagcag 
ggggctggct 
ccccttccac 
gggctggagc 
cccggccgct 
cgccccctgt 
cccgctgctg 
gctcctggac 
accacgcctg 
tgagatggag 
gccctgcatc 
atctggcctc 
gcgaaaaaaa 



cagagctcgc 
cggccccggc 
caggggccgg 
cggaggcaca 
gtcgtctccc 
accttctctg 
ctgggcgcct 
ccagtgctgc 
gagggacggc 
ctgaggaaga 
cgccaggtgt 
gcccctgcac 
ctgctggagg 
gcacccggct 
gatggccctg 
ccgcccccac 
acacacctgc 
gccaggctgg 
gacctgatgc 
ctggcacgga 
aaagccaaga 
aaaaaaaaaa 



Homo sapiens 



67 <213> ORGANISM: Organ ismName 

68 <210> SEQ ID NO: 

68 <400> SEQUENCE: PreSequenceString : 

69 hPgargamvI msEFsadPag QsQgQQkPLr vgFydlErtL 

70 alklldktrL dssnLEklyr EvQLmkLLnh PhllkLyQvm 

71 yLtsnghLsE nEarkkFwQI LsavEychdh hlvhrdLktE 

72 yksgEPLstw cgsPPyaaPE vFEgkEyEgP QLdlwsLgw 

73 QrvLEgrFrl PFFmsQdcEs LIrrmLvvdP arrltlaQIr 

74 ahsytsnLgd ydEQaLglmQ tLgvdrQrtv EsLQnssynh 

75 arPgParQPr PrssdLsgLE vPQEgLstdP FrPaLLcPQP 

76 QwPLFFPvda scsgvFrPrP vsPssLLdta IsEEarQgPg 

77 LaEvstrLsP LtaPcIvvsP sttasPaEgt ssdscLtFsa 

78 sPvrLasPFL gsQsatPvLQ aQggLggavL LPvsFQEgrr 

79 trtkgFLgLn klkgLarQvc QvPasrasrg gLsPFhaPaQ 

80 vLEQQrLLQL QhhPaaaPgc sQaPQPaPaP FvIaPcdgPg 

81 LQtgasPvas aaQLLdthLh IgtgPtaLPa vPPPrLarLa 

82 csLgtFvLvQ 

83 <212> TYPE: Type : PRT 

84 <211> LENGTH: Length : 790 

85 SequenceName : SEQ ID NO: 2 



gkgnFawkL 
EtkdmLylvt 
nLLLdgnmd I 
LyvLvcgsLP 

QhrwmraEPc 
FaalyyLLLE 

QtLvQsvLQa 
LEEEQdtQEs 
sksPagLsgt 
asdtsLtQgL 
sPgLhggaag 
aaPLPstLLt 
PgcEPLgLLQ 



arhrvtktQv 
EFakngEmFd 
kLadFgFgnF 
FdgPnLPtLr 

LPgPacPaFs 
rLkEyrnaQc 
EmdcELQssL 
LPsstgrrht 
PatQgLLgac 
kaFrQQLrkt 
srEgwsLLEE 
sgLPLLPPPL 
gdcEmEdLmP 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2533 



60 
120 
180 
240 

300 
360 
420 
480 
540 
600 
660 
720 
780 
790 
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RAW SEQUENCE LISTING DATE: 12/19/2002 

PATENT APPLICATION: US/09/787 , 559B TIME: 08:59:57 



Input Set : A:\400.001-6seq.WorkFile.txt 
Output Set: N:\CRF4\12192002\l787559B.raw 



W — > 86 SequenceDescription : 
W — > 88 Sequence 
W— > 89 

90 <213> ORGANISM: OrganismName : Homo sapiens 
E — > 91 <210> SEQ ID NO: 

W — > 91 <400> SEQUENCE: PreSequence String : 



E — > 92 PEAAAAAAAA GAVGTRAAPA AERAASWPGR SGGGGGARGA MVIMSEFSAD PAGQSQGQQK 60 

E — > 93 SLRVGFYDIE RTLGKGNFAV VKLARHRVTK TQVAIKIIDK TRLDSSNLEK IYREVQLMKL 120 

E — > 94 LNHPHIIKLY QVME TKDMLY IVTE FAKNGE MFDYLTSNGH LSENEARKKF WQILSAVEYC 180 

E — > 95 HDHHIVHRDL KTENLLLDGN MDIKLADFGF GNFYKS GE PL STWCGSPPYA APEVFEGKEY 240 

96 EGPQLDIWSL GVVLYVLVCG SLPFDGPNLP TLRQRVLEGR FRIPFFMSQD CESLIRRMLV 300 

E— > 97 VDPARRITIA QIRQHRWMRA EPCLPGPACP AFSAHSYTSN LGDYDEQALG IMQTLGVDRQ 360 

98 RTVESLQNSS YNHFAAI YYL LLERLKEYRN AQCARPGPAR QPRPRSSDLS GLEVPQEGLS 4 20 

99 TDPFRPALLC PQPQTLVQSV LQAEMDCELQ SSLQWPLFFP VDASCSGVFR PRPVSPSSLL 4 80 

100 DTAISEEARQ GPGLEEEQDT QESLPSSTGR RHTLAEVSTR LSPLTAPCIV VSPSTTASPA 54 0 

101 EGTSSDSCLT FSASKSPAGL SGTPATQGLL GACSPVRLAS PFLGSQSATP VLQAQGGLGG 600 
E — > 102 AVLLPVSFQE GRRASDTSLT QGLKAFRQQL RKTTRTKGFL GLNKI KGLAR QVCQVPASRA 660 

103 SRGGLSPFHA PAQSPGLHGG AAGSREGWSL LEEVLEQQRL LQLQHHPAAA PGCSQAPQPA 720 

104 PAPFVIAPCD GPGAAPLPST LLTSGLPLLP PPLLQTGASP VASAAQLLDT HLHIGTGPTA 780 
E — > 105 LPAVPPPRLA RLAPGCEPLG LLQGDCEMED LMPCSLGTFV LVQ 823 



E — > 106 <212> TYPE: Type : PRT 
W— > 107 <211> LENGTH: Length : 823 
W — > 108 SequenceName : SEQ ID NO: 3 
W — > 109 SequenceDescription : 
W — > 111 Sequence 

W— > 112 

113 <213> ORGANISM: OrganismName : Homo sapiens 
E — > 114 <210> SEQ ID NO: 

W — > 114 -<400> SEQUENCE: PreSequenceString : 



E — > 115 ccccggaggc agcagcagcg gcggcggcag ccggagcagt aggcacccga gcagcgccag 60 

E — > 116 cggccgagcg ggcggcttcc tggcctgggc gctccggtgg cggcggaggt gcgcgcggag 120 

E — > 117 ccatggttat catgtcggag ttcagcgcgg accccgcggg ccagagtcag ggccagcaga 180 

E — > 118 agtccctccg ggtgggtttt tacgacatcg agcggaccct gggcaaaggc aacttcgcgg 240 

119 tggtgaagct ggcgcggcat cgagtcacca aaacgcaggt tgcaataaaa ataattgata 300 

E — > 120 aaacacgatt agattcaagc aatttggaga aaatctatcg tgaggttcag ctgatgaagc 360 

121 ttctgaacca tccacacatc ataaagcttt accaggttat ggaaacaaag gacatgcttt 420 

122 acatcgtcac tgaatttgct aaaaatggag aaatgtttga ttatttgact tccaacgggc 480 

123 acctgagtga gaacgaggcg cggaagaagt tctggcaaat cctgtcggcc gtggagtact 540 

124 gtcacgacca tcacatcgtc caccgggacc tcaagaccga gaacctcctg ctggatggca 600 
E — > 125 acatggacat caagctggca gattttggat ttgggaattt ctacaagtca ggagagcctc 660 

126 tgtccacgtg gtgtgggagc cccccgtatg ccgccccgga agtctttgag gggaaggagt 720 

127 atgaaggccc ccagctggac atctggagcc tgggcgtggt gctgtacgtc ctggtctgcg 780 
E — > 128 gttctctccc cttcgatggg cctaacctgc cgacgctgag acagcgggtg ctggagggcc 840 

129 gcttccgcat ccccttcttc atgtctcaag actgtgagag cctgatccgc cgcatgctgg 900 

130 tggtggaccc cgccaggcgc atcaccatcg cccagatccg gcagcaccgg tggatgcggg 960 

131 ctgagccctg cttgccggga cccgcctgcc ccgccttctc cgcacacagc tacacctcca 1020 

132 acctgggcga ctacgatgag caggcgctgg gtatcatgca gaccctgggc gtggaccggc 1080 

133 agaggacggt ggagtcactg caaaacagca gctataacca ctttgctgcc atttattacc 1140 

134 tcctccttga gcggctcaag gagtatcgga atgcccagtg cgcccgcccc gggcctgcca 1200 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: 03/09/787,5596 



DATE: 12/19/2002 
TIME: 08:59:57 



Input Set : A:\400.001-6seq.WorkFile.txt 
Output Set: N:\CRF4\12192002\l787559B.raw 



135 ggcagccgcg gcctcggagc tcggacctca gtggtttgga 

136 ccaccgaccc tttccgacct gccttgctgt gcccgcagcc 

137 tcctccaggc cgagatggac tgtgagctcc agagctcgct 

138 cggtggatgc cagctgcagc ggagtgttcc ggccccggcc 

139 tggacacagc catcagtgag gaggccaggc aggggccggg 

140 cgcaggagtc cctgcccagc agcacgggcc ggaggcacac 

141 gcctctcccc actcaccgcg ccatgtatag tcgtctcccc 

142 cagagggaac cagctctgac agttgtctga ccttctctgc 

143 tcagtggcac cccggccact caggggctgc tgggcgcctg 

144 cgcccttcct ggggtcgcag tccgccaccc cagtgctgca 

145 gagctgttct gctccctgtc agcttccagg agggacggcg 
14 6 ctcaagggct gaaggccttt cggcagcagc tgaggaagac 

147 tgggactgaa caaaatcaag gggctggctc gccaggtgtg 

148 ccagcagggg cggcctgagc cccttccacg cccctgcaca. 
14 9 gcgcagccgg cagccgggag ggctggagcc tgctggagga 

150 tgctccagtt acagcaccac ccggccgctg cacccggctg 

151 cccctgcccc gtttgtgatc gccccctgtg atggccctgg 

152 ccctcctcac gtcggggctc ccgctgctgc cgcccccact 

153 cggtggcctc agcggcgcag ctcctggaca cacacctgca 

154 ccctccccgc tgtgccccca ccacgcctgg ccaggctggc 

155 ggctgctgca gggggactgt gagatggagg acctgatgcc 

156 tcctggtgca gtgagggcag ccctgcatcc tggcacggac 

157 cttcagagga ggtgaagaca tctggcctca aagccaagaa 
E — > 158 caatacgtta ggtgttttgg cgaaaaaaaa aaaaaaaaaa 
E — > 160 <212> TYPE: Type : DNA 

W— > 161 <211> LENGTH: Length : 2632 
W — > 162 SequenceName : SEQ ID NO: 4 
W — > 163 SequenceDescription : 
W — > 165 Sequence 

W— > 166 

167 <213> ORGANISM: OrganismName : Homo sapiens 
E — > 168 <210> SEQ ID NO: 

W — > 168 <400> SEQUENCE: PreSequenceString : 

E — > 169 tgagcaggcg ctgggtatca tgcag 

E — > 170 <212> TYPE: Type : DNA 

W — > 171 <211> LENGTH: Length : 25 

W — > 172 SequenceName : SEQ ID NO: 5 

W — > 173 SequenceDescription : 

W — > 175 Sequence 

W — > 176 

177 <213> ORGANISM: OrganismName : Homo sapiens 
E — > 178 <210> SEQ ID NO: 

W — > 178 <400> SEQUENCE: PreSequenceString : 

E — > 179 tcaccgggaa caagaagggc cacct 

E — > 180 <212> TYPE: Type : DNA 

W— > 181 <211> LENGTH: Length : 25 

W— > 182 SequenceName : SEQ ID NO: 6 

W — > 183 SequenceDescription : 



ggtgcctcag 
gcagaccttg 
gcagtggccc 
cgtgtcccca 
cctagaggag 
cctggccgag 
ctccaccacg 
gagcaaaagc 
ctccccggtc 
ggctcagggg 
ggcgtcggac 
cacgcggacc 
ccaggtccct 
gagcccaggc 
ggtgctagag 
ctcccaggcc 
ggctgccccg 
cctgcagacc 
cattggcacc 
cccaggttgt 
ctgctcccta 
actgactctt 
ctttctagaa 
aaaaaaaaaa 



gaaggtcttt 
gtgcagtccg 
ttgttcttcc 
agcagcctgc 
gagcaggaca 
gtctccaccc 
gcaagtcctg 
cccgcggggc 
aggctggcct 
ggcttgggag 
acctcactga 
aaagggtttc 
gccagccggg 
ctgcacggcg 
cagcagaggc 
ccccagccgg 
ctccccagca 
ggcgcgtccc 
ggccccaccg 
gagcccctgg 
ggcacgtttg 
acagcaataa 
gcgaaataag 
aa 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2632 



25 



25 
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< 1 1 0 > Smith, John; ..Srolthgcnc Inc. 

<120> Example of a Sequence Listing ^ . j 

< 1 30> 01 -J00001 tz 

\l — 

<uo* rcT/Er98/ooooi •"=*-" 

<U1> ..HTt; 1998-12-31 v ./ 

<1S0> US 08/999.999 : 

<1S1> 1997-10-15 

<1C0> < • 

<170> Patcntln version 2.0 * : 

< 2 1 0 > 1 

< 2 1 I > 369 

< 2 I 2 > DMA A * 

< 2 I 3 > Paramecium sp. 

< 220> 

< 2 2 1 > CDS 

< 2 2 2 > (279) . . . (389 ) 

<300> 

< 3 0 1 > Doc. Richard 

< 302 > Isolation and Characterization of *) Ccnc encoding a 

Protease from Paramecium sp . 

<303> Journal of Genes 

< 304 > 1 
<305> 4 

< 306> 1-7 
<307> 1988-06-31 
<300> 123456 
<309> 1906-06-31 

<400> 1 

agcigiagic attcctgtgt" cctcttctct ctgggcuci caccctgcta atcagacctc 60 

\ & 

agggagagtg tcttgaccc.t, cctccgcctt tgcagctica caggcaggcxi ggcaggcagc 120 

cgatgtggca attgetggea gtgecacagg cctctcagcc aggcttaggg tgggttccgc 180 

cgcggcgcgg cggcccctct cgcgctcctc tcgcgcctci ctctcgctct cctctcgctc 240 



i'UH/i-ALC//Z'0 
Annex VII. page 30 



Appendix 3, page 2 . 

^cccc.ct aggtgagcag gaggaggggg cottage atg gtt tea atg tec age 296 

X 5 

ctg tct etc aaa tgg <:ct gga tct tgt ttg tec gtt tgt ttg tec caa 3 << 

Leu SCr Phc Lys , Trp Pro Cly Phc Cys t Leu Phc Val Cys Leu Phc Cln 

1.0 , 15 20 " 

tgt ccc aaa gtc etc ccc cgc cac tea tea ctg cag ccg aat ctx/ 3B9 
Cys Pro Lys rVal Leu Pro Cys His Scr Scr, -Leu CI n Pro Asn-_-J*tu-- 
25 30 V 35 

<210> - 2 

<2 1 1 > •'•iri* 37 ' ; - 

<212> •-• PRT •* , 

<213> Paramecium sp. y ^/ 

V 

«00> 2 . 

Met Val Scr Met Phc Scr Leu Scr Phc Lys Trp Pro Cly Phc Cys Leu 
■ X 5 10 15 

Phc Val Cys Leu Phc Cln Cys Pro Lys Val Leu Pro Cys His Scr Scr 
20 25 30 

Leu Cln Pro Asn Lou 

<210> ) 

< 2 1 1 > 11 
<212> PKT 

<213> Artificial Sequence 
<220> 

< 22 3 > Designed peptide based on sire and polarity to act as a 

linker between the alpha and beta chains of Protein XYZ. 

<<00> ) 

Met val Asn Leu Clu i»ro Met His Thr Clu lie 
1 S 10 

<210> 4 
<<00> 4 
000 



(Annex VIII follows! 



tabic. The numeric identifier shall be used only in chc^Scqucncc 
tisting." The order and presentation of the items of information in the 
"Sequence Listing" shall conform to the arrangement given below. Each 
item of information shall begin on a new line and shall begin with the. 
numeric identifier enclosed in. angle brackets as shown., The submission 
of those items of information designated with an M M" is mandatory. The 
submission of those items of information designated with an M O w is 
optional. Numeric identifiers <110> through <170> shall only, be set 
forth at the beginning of the "Sequence Listing." T£e following table 
illustrates the numeric identifiers. 



Numeric 
Identifier 



Definition 



Comments and 
Fo rma t 



Mandatory (M) or 
Optional JOJ 



<110> 



Applicant 



Preferably max. 
of 10 names; 
one name per line; 
preferable .format: 
Surname, Other/* 
Names and/or 
Initials 



M 



<120> 



Title of 
Invention 



M 



<130> 



file Reference 



Personal file 
re f crcncc 



Mj, when filed prior 
to assignment of 
appl . numbe r 



<1<0> 



Current Applica- 
tion Number 



Speci f y as : 

US 07/999, 999 or 

PCT/US96/99999 



M, i f a vai lablc 



<14 1 > 



Current filing 
Date 



Specify as: yyyy-mm-dd M, if available 



<1S0> 



Prior Application Specify as: 
Number US 07/999,999 or 

PCT/US9G/99999 



M, if applicable 
include priority 
documents under 
35 USC 119 and 
120 



< 1 5 1 > 



Prior Application Specify as: yyyy-mm-dd M, if applicable 
filing Date 



<1G0> 



Number of SEQ ID Count includes 



NOs 



total number of 
SEQ ID NOs 



<170> 



Sof twa rc , 



Name of software used 
to create the 
Sequence Listing 



<2 10> 



SEQ ID NO: If : 



Response shall be an 
integer repre- 
senting the SEQ 
ID NO shown 



M 



<21 1> 



Length 



Respond with an integer M 
expressing the number 
of bases or amino acid 
res iducs 



Type 



< 2 1 3 > Organism 



: ' Whether presented 
sequence mole- 
cule is DNA, 
RNA, or PRT 
(protein) . I f 
a nucleotide 
sequence con- 
tains both DNA 
and KNA frag- 
ments, the 
type shall be 
"DNA. M In ad- 
" dition, the 
combined DNA/ 
RNA molecule 
shall be further 
described in 
the <220> to 
<223> feature 
section. /• 

Scientific name, 
i.e. Ccnus/spccics , 
Unknown or Artifi- 
cial Sequence. In 
addition, the 
"Unknown" or 
"Artificial Se- 
quence" organisms 
shall be further 
described in the 
<220> to <223> 
feature section. 



M 



<220> 



feature 



Leave blank after 
<220>. <221-223> 
provide for a 
description of 
points of bio- 
logical s igni - 
ficancc in the 
sequence . 



M, under the 
following condi- 
tions : i f "n, M 
"Xaa, " or a mod- 
i f i ed or unusua 1 
L-amino acid or 
modified base was 
used in a se- 
quence; i f ORGAN - 
ISM is "Artifi- 
cial Sequence" or 
"Unknown" ; i f 
molecule is 
combined DNA/RNA. 



< 22 1 > 



Name/ Key 



<222> 



Loca t i on 



Provide appropriate 
iden t i f i cr for 
f ca turc, pre- 
ferably f rom 
WIPO Standard 
ST. 25 (1998), 
Appendix 2, 
Tables 5 and 6 

Specify location 
within sequence; 
"here appropriate 
state number of 
first and last 
t>a s c s / am i no «*cids 



M, unde r the f ol - 
lowing conditions 
if "n, " "Xaa , " or 
a mod i f i ed or un- 
usual L-amino 
acid or modified 
base was used in 
a sequence 

M, under the fol- 
lowing conditions: 
if "n. " "Xaa, "or 
a modified or un- 
usual L- ami no 
acid or mod i f i cd 



,,}tj/99 I S J I'M 



<223> 



Other Infor- 
mation 



feature 



Other relevant 

information; 

four lines maximum 



Lsc was us {fc* X n m . 
frequence * • 

M, under the fol- 
lowing conditions : 
£f "n, M "Xaa,"'or 
"a modified or un- 
usual L-amlno acid 
or modified base 
was used in a 
sequence; if 
ORGANISM 
is -Artificial 
Sequence" or 
••Unknown"; U^=* 
molecule is com- 
bined DNA/RNArr* r 



<300> 



<301> 



Publication 
Information 



Authors 



Leave 
a f tcr 



blank 
<300> 



Preferably max 
of ten named 
authors of publi- 
cation; specify 
one name per line; 
preferable format : 
Surname. Other * 
Names and/or 
Initials 



0 



< 302> 
<303> 
<30« > 
<30S> 
<30C> 
<307> 



< 300> 



< 309> 



<3 10> 



Title 

Journal 

Vol umc 

Issue 

Pages 

Date 



Oat aba sc 
Acccs s i on 
Numbc r- 



Oat a base Gntry 
Da tc 



Patent Document 
Numbc r 



Journal date on which 
data publ i shed; 
specify as yyyy-mm- 
dd. MMM-yyyy or 
Season - yyyy 

Accession number 
assigned by data- 
base including 
database name 

Date of cntry in 
database; specify 
as yyyy-nvn-dd or 
MMM- yyyy 

Document number; 
for patent-type 
citations only. 
Spec i f y as . for 
example. US 
07/999. 999 



< 3 1 1 > 



<312> 



Pa ten c ri^^jq 

Date 



Publication Dace 



<313> 



<400> 



He leva ht 
Residues 

Sequence 



Document filing 
dace, for patent- 
type citations only; 
specify as yyyy-mm-dd 

•'Document publication 
date, for 
patent-type 
citations only; 
* specify as yyyy-twn-dfr 

FROM (position) TO 
(position) 



SCO 10 HO should 

follow the 
numeric identifier 
and should appear 
on the line pre- 
ceding the actual* 
sequence 



S. Section 1.024 is revised to read as follows: 



1.02< Torm and. format for nucleotide and/or amino acid sequence 
submissions in computer readable form. 



(a) The computer readable form required by 
fol lowing speci f i ca t ions : 



1.02) (c) sha 1 1 meet the 



M) The computer readable form shall contain a single "Sequence Listing*' 
as either a diskette, scries of diskettes, or other pc rmi s s i bl c . media 
outlined in paragraph {c ; 1 of this section. 

12) The "Sequence Li s t wig".', in paragraph (a) (1) of this section shall be 
submitted in American Standard Code for Information Interchange (ASCII) 
text. No other formats shall be allowed. 



(3) The computer readable form may be created by any means, such as 
processors, nucleotide/amino acid sequence editors or other custom 
computer programs; however/ it shall conform to ail specifications 
detailed in this section. 



word 



H) Tile compression is acceptable when using diskette media, 
the compressed file is in a scl f -c>:c tact ing format that will 
on one of the systems described 



-extracting 
in paragraph 



(b) of this 



so long as 
decomp res s 
sect ion . 



(S) Page numbering shall not Appeal within the computer readable form 
version of the "Sequence Listing" file. 

. t' 

(01 All computer readable forms sha 1 1 have a label permanently affixed 
thereto on which has been hand -p r i n ted or typed: the name of the 
applicant, the title of the invention, the date on which the data were 
recorded on the computer readable form, the operating system used, a 
reference number, and an application serial number and filing date, if 
known. ^ 



<b| Computer readable form submissions must meet these format 
rcqu l re men t s : 



(II Computer: I DM PC/XT/AT. or compatibles, or Apple Macintosh; 
' 2 ' Operating System: MS-DOS. Unix or Macintosh; 



